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Abstract 

Given A C N, we write rd(n;A) and Sd{x]A) to refer to the number of solutions 
to ki + k2 + ■■■ + kd = n and ki + k2 + ■ ■ ■ + kd ^ x respec., where d ^ 1 is fixed and 
all ki G A. In this paper we study the asymptotic behaviour of rd(n; A) and Sd{x;A) 
when IA fl [0,a;]| can be estimated by O-regularly varying functions. We then give 
necessary and some sufficient conditions for A to form an asymptotic basis. 

An asymptotic /i-basis is often called economic when rh{n] B) <C£ ■ In view of 

Erdos-Tetali theorem on economic bases, that ensures the existence of h-hases B with 
rfi{n;B) x log(n), we describe a general method to study random sequences w C N 
with a given asymptotic behaviour |w H [0,a;]| ^ fix) for certain /s. An explanation 
of how log(n) appears as a natural limitation for standard probabilistic approaches to 
this problem is provided once we derive the existence of asymptotic ft.-bases satisfying 
rdin; B) x f'in)fin)^~^ for every d ^ h. 

Finally, we present an elementary combinatorial heuristic approach that predicts, 
except for a constant, Hardy-Littlewood’s extended Goldbach’s conjecture. 
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1 Introduction 

In this paper we shall call S the set of subsequences of the sequence of nonnegative 
integers, i.e. the set of monotone strictly increasing nonnegative integer sequences, or 

Definition 1.1 (Space of nonnegative integer sequences). 

S:={AC Z^o : |^| = i^o}. 

A central problem in additive number theory is to decide whether a subsequence of 
the sequence of nonnegative integers is a finite basis or asymptotic basis. We will describe 
some general methods that uses the asymptotic density of a sequence to decide this. All 
information will be derived in terms of the representation functions we are about to define. 


1.1 Representation functions 


Definition 1.2 (Representation functions). Let A € 5 and d ^ 2 an integer. For every 
nonnegative integer n we define 


• r(n; A) = ri(n; A) 


1 if n G A, 
0 if n ^ A. 


• rd{n;A):= ^ r{ki; A)r{k 2 ] A).. .r{kd; A) 

ki+k2+...+kd=n 


and, for every nonnegative real number x, 

• s{x; A) = si(x; A) := |A n [0, x] | 

• Sd{x;A) := '^rd{n]A). 

n^x 

That is, r(n;A) works like a characteristic function of A in whereas s{x\A) 

counts the number of elements of A up to x. Thus, rd{n; A) is the number of solutions to 
ki + k 2 + ■ ■ ■ + kd = n with kj G A considering permutations, and Sd{x; A) the number of 
solutions to ki + k 2 + ■■■ + kd ^ X with kj G A also considering permutations. We may 
illustrate this via the formal series 

A . (1.1) 

n^O \keA J 

One, however, should not be bothered by the fact that our representation functions 
count repetitions. In fact, defining 

n 

{/ci, A: 2 , ..., fed} c A : ^ kj = n 
i=i 

and s^(x;A) := A), it is easily seen that r'^{n-,A) x rd{n;A) given that 

rd{n; A)/d\ ^ r^{n;A) ^ rd{n\A). The same, consequently, holds for Sd and so results 
about growth orders can be easily translated from one definition to another. It is worth 
noting that we use a lot of asymptotic notation in this paper, all standard on literature. 

The first interesting thing that can be said about our representation functions is that 
these are related by some recurrence relations. 


r*d{n]A) : = 


2 




Proposition 1.3 (Recursive formulas). Let A ^ S, n ^ d ^ 2. Then for every 

l^l<d: 

(i) rrf(n; A) = Jff^^^rd-i{t, A)ri{n - k;A) 

(ii) Sd{n;A) = Y.k^n'^d-i{k] A)si{n - k]A) 

(in) Sd{n] A) = Ylk^n Sd-i{k; A)ri{n - k; A). 

Proof. The item (i) cames easily from the formal series (1.1). For (ii), by (i): 

n k 

Sd{n;A) = EE rd-i{j;A)ri{k - j;A) 

k=0 j=0 
n n 

= EE rd-i{j-,A)ri{k - j;A) 

j=0 k=j 

Similarly, for (iii): 

n k 

Sd{n;A) = EE rd-l{k - j-,A)ri{j;A) 

k=0 j=0 
n n 

= EE rd-i{k - j; A)ri{j-, A) = '^Sd-i{j; A)ri{n - j] A). ■ 

i=0 k=j j=0 

Usually when we cite these formulas we will be referring to the case 1 = 1. Note that 
these formulas explain in part why rd{n;A) is generally easier to deal rather than the 
version without repetitions. As an example, take Z^o- It is easy to deduce that 

rd (n;Z^o) = 

for d ^ 2, whereas the version without repetition is well-known to be more difficult to 
deal. We also have a naive way to convert information from Sd to r^, namely 

Definition 1.4 (Mean value of r^). Let A € 5 and d ^ 1 an integer. For every nonnegative 
real number x we define 

- / 4 N Sd{x-,A) 

rdix-,A) := -. 

X 


n -|- d — 1\ 

, d-1 J’ 


= '^rd-i{j-, A)si{n - j; A). 

j=0 


1.2 Asymptotic notation 

Throughout this paper, our asymptotic notations will always refer to x —>■ -|-oo through 
the reals and n ^ -|-oo through the integers, unless otherwise specified. 

We say that a real function / is asymptotically defined when it is defined in [a, -|-oo) 
for some a ^ 0. Whenever we write an ”a” that is not specified in the context, there will 
be one or more asymptotically defined functions within the context, and the ”a” will be 
referring to the least a > 0 such that all of those are well-defined. 

We write f^g, gl$>for:f = 0{g) when 


lim sup 

x—>-+oo 


fix) 

gix) 


< -too; 


3 






and f<g,g>-f or f = o{g) when lim f{x)/g{x) = 0. When these notation came with 
a snbscript e it means that the estimate is valid for all e > 0. For example, f{x) = 
Os{gix,e)) means that for every e > 0 holds limsup^, \ f{x)/g{x,e)\ < +oo. 

When f ^ g and 5 <C / we write f g or f = @{g), that is equivalent to 

3M > O:\fx > M,3ci,C2 > 0 : ci\g{x)\ ^ |/(x)| ^ C 2 | 5 (x)|; 
and f ^ g when lim f (x)/g{x) = 1 . 

Let us now talk a bit about regularly varying functions. An extensive treatment of 
this kind of functions may be found in N. H. Bingham’s et al. book Regular Variation. We 
state the next definition with Karamata’s characterisation theorem in mind (for reference, 
see Theorem 1.4.1 from [2]). 

Definition 1.5 (Regular variation). Let / be an asymptotically defined positive real¬ 
valued measurable function. We say that / is: 

Slowly varying : VA > 0, /(Ax) r\j fix)] 

Regularly varying : 3/3 G M : VA > 0, /(Ax) ~ A^/(x); 

O- regularly varying : VA > 0, /(Ax) X fix). 

The kind of functions that we will treat, however, are the following. 

Definition 1.6 (Type-1 function). A type-1 function is an asymptotically defined real¬ 
valued function / that is positive, monotonic and O-regularly varying. 

Definition 1.7 (Type-2 function). A type-2 function is an asymptotically defined real¬ 
valued differentiable function / that diverges, such that both / and f' are positive, mono¬ 
tonic and O-regularly varying, with f being bounded. 

We will need functions like this to deal with the kind of convolutions that will show 
up. The reader should no be bothered by these apparent ad hoc assumptions, for the 
majority of interesting cases are within the conditions we shall assume throughout this 
paper, like x“ log(x)^ and iterated logs, for example. 

Having done the setup, we now describe briefly the content of the paper. 

1.3 Sketch of the sections 

In Section 2 we discuss general aspects of the growth of the representation functions, 
starting by showing that Sdix]A) >- Srf_i(x;A) for all A G <S. We then describe an 
integral formula relating Sdix\A) to Sd-iix]A) when s{x]A) ~ /(x) for some type- 
1 function. Later, we explore some ideas from L. G. Schnirelmann described by M. B. 
Nathanson in [10] to state sufficient conditions to ensure that a certain sequence A G <S 
forms an asymptotic basis. We finish by remarking on some limitations of our methods 
via counterexamples. 

In Section 3 we explore a commonly used probabilistic construction that assigns a 
probability to decide whether a sequence contains or not a certain n G by taking 
r(n, u) to behave as a Bernoulli trial. Then, with all our representation functions behaving 
as random variables, we study the structure of this space related to these r.v.s, specifically 
the asymptotic behaviour a.s. of Sd and of the expected value of r^/, both in terms of 
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type-1 and type-2 functions /. The setup of the work of P. Erdos and P. Tetali in [5] is 
included within the case when f{x) = log(x)^/'^. 

Still on Section 5, our results lead to the concept of /-regular and 0(/)-regular asymp¬ 
totic bases. We reinterpret Erdos-Tetali theorem and the P. Erdos’ ’’strong Erdos-Turan” 
conjecture (which was first presented in [4]) in terms of 0(/)-regular asymptotic bases, 
remarking the motivation of these by the technical limitations of the ’’exponential bounds 
and Borel-Cantelli’s lemma” approach to the economic bases problem. 

Finally, on Section 4 we present an elementary combinatorial heuristic argument that 
motivates a conjecture almost as strong as Hardy-Littlewood’s extended Goldbach’s con¬ 
jecture. The probabilistic construction to be used there will be substantially different 
from the one we shall describe in Section 3. We use, basically, calculus, the PNT, esti¬ 
mates of Euler’s totient function ip and the distinct prime divisors counting function a;, 
a exponential bound for the hypergeometric probability distribution and Borel-Cantelli’s 
lemma. As one should expect, we do not deduce exactly Hardy-Littlewood’s extended 
Goldbach’s conjecture, but with considerably fewer methods we managed to miss it only 
by a constant! 

2 Asymptotic behaviour of asymptotic bases 

Our main results are Theorems 2.5, 2.6 and Corollary 2.11. 

2.1 General approach 

With the recurrence formulas in Proposition 1.3 we can start to relate the growth of 
the Sd function for d ^ 2 with the growth of s{x] A) for a fixed A € 5. 

Lemma 2.1. Let A € 5 and x G Mj>o. For every d ^ 1, 

s{x/2]AY ^ Sd{x]A) ^ s(x;A)'^. 

Proof. It obviously holds for d = 1, so we proceed by induction. We have 

Sd{n] A) = ^ Sd-i{k-, A)r(n - k; A) 

k^n 

^ Sd-i{n; A) ^ r(n - k] A) 

k^n 

= Sd-i{n] A)s{n; A) ^ s(n;A)'^, 


and 


Sd{n;A)^ ^ Sd-i{k; A)r{n - k] A) 

n/2^k^n 

^ Sd-i{nf2;A) ^ r{n-k]A) 

n/2^k^n 

= Sd-i(n/2; A)s(n/2; A) ^ s(n/2; A)^ (2.1) 


as required. 
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Note that this result implies that Sd{x; A) x s(x; AY when s(x; A) is 0-regularly 
varying. Actually, we can state this in a quite more general form: 

Theorem 2.2. Let / be 0-regularly varying and A € 5 a sequence. Thus, 


If s(a:; A) □ /(x), then Sd(x; A) □ f{xY, 


where can be any of the asymptotic signs x, <C, -<. 


Proof. Follows immediately by Lemma 2.1. 


in order to a given sequence A € 5 be an asymptotic basis, a necessary condition is to 
have rd(x', A) ^ 1 for some d ^ 1, otherwise rd{n\ A) will certainly be 0 for infinite values 
of n. Put differently, we deduce from these considerations that 

Corollary 2.3. If A € 5 is an asymptotic basis, then s(x; A) ^ x^ for some e > 0. 

Proof. Otherwise, by Theorem 2.2, we would have Sd{x;A) <C£ x^ for every d ^ 1, that 
inhibits rd{n] A) from being > 0 for all sufficiently large n. ■ 

We will describe now, however, a result that holds in general, but first we need another 
lemma. 

Lemma 2.4. Let A G 5 be a sequence. If L is an asymptotically defined positive non¬ 
decreasing function such that L{x) -< s(x; A), then, for every integer d ^ 1, 

Srf(x; A) ~ Sd{x - L(x); A). 

Proof. We proceed by induction on d. When d = 1, taking sufficiently large x: 

s(x; A) = ^ r(n; A) ^ ^ r{n-, A) + L{x) = s{x — L{x)-, A) + L{x) 

riiix n!ix—L{x) 

So s(x;A) ~ s{x — L{x)\A). Assuming now that Sd-i{x;A) ~ — L(x);A), 

follows, taking sufficiently large n. 



^ sa-i{k'. A)r{n- 


k-,A)+ Y. Sd-i{k-,A)r{n - k;A) 


L(n)^k^n 


k<L{n) 


71 (n) 


72 (n) 


Now, observing each term separately, we have 


7 i(n) > s{n - L(n); A)srf_i(L(n); A) ~ s{n-, A)sd-i{L{n)] A) 


and 


12 (^ 1 ) < (s(u; A) - s{n - L{n); A)) Sd-i{L{n)] A) ^ s{n-, A)sd-i{L{n); A) 
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thus 7 i(n) 72 (?^)- We conclude: 

Sd{n;A)r~u ^ Sd-i{k; A)r{n - k] A) 

L{n)^k^n 

~ ^ Sd-i{k - L{n); A)r{n - k; A) 

L{n)^k^n 

= ^ Sd-iik]A)r{n - k - \L{n)];A) = Sd{n - lL{n)']-,A). 

k^n—\ L{n)~\ 

Given that Sd and L are non-decreasing, we conclude Sd(x; A) rsj Sd{x - L{x)]A). ■ 

Theorem 2.5. If ^ G <S is an arbitrary sequence, then Sd{x]A) >- Sd-i{x',A) for every 
integer 2. 

Proof. Take a real-valued function L satisfying the hypotheses from Lemma 2.4 such that 
L{x) >- 1. Such a function, in fact, exists. We can take, for example, for some e > 0, 

L{x) := s{x-, AY~^. 

Cleary, such L{x) satishes 1 ^ L{x) ■< s(x; A). Now proceeding we have, from Lemma 
2.4, for every integer d ^ 2, 

Sd{n; A) = '^ Sd-i{t, A)r(n - k; A) 

k^n 

> ^ Sd-i{n - k; A)r{k; A) 

k^L{n) 

^ Sd-i{n - L{n); A)s{L{n); A) > Sd-i{n; A)s{L{n); A) 

Given that both s{x-,A) and L{x) diverges, we have s{L{x)]A) >- 1. Hence, we deduce 
Sd{x-,A) Sd_i(x; H)s(s(x; H), and a fortiori, Sd{x;A) >- Sd-i{x] A), as required. 


We have just proved that fd{x;A) >- fd-i{x;A) for arbitrary sequences A. In plain 
English, the growth order of the average number of ways to write N as the sum of d 
elements of any given sequence is ’’bigger” than the growth order of the average number 
of ways to write N as the sum of d — 1 elements of the same sequence. 

2.2 Integral formula for Sd 

Based on Theorem 2.2, we now study the case when the asymptotic sign is that 
is a bit more delicate than the others. For the sake of generality, we shall consider an 
error term 0{e{x)) on the approximation of s(x;H) by some function /. 

Theorem 2.6 (Integral formula for Sd)- If A G S is such that s(x;H) = f{x) + 0{e{x)), 
with f being and e{x) = o{f{x)), both type-1, then for every integer d ^ 1 holds 

Sd{x; A) = f Sd-i{t; A)f{x - t)dt + 0{f{xY~^e{x)). 

J a 
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Proof. By Theorem 2.2, consider the following: 


'^rd-i{t,A)s{n -t,A) ^ Sd-i{a; A)s{n; A) < /(re), (2.2) 

k^a 

^ rd-i{t,A)s{n-t,A) ^ Sd-i{n-,A)s{a;A) (2.3) 

n—cx<k^n 


Then, by the recursive formulas, one can deduce* 

Sd{n-,A) = '^rd-i{k-,A)s{n-k-,A)= ^ rd-i{k-, A)f{n - k) + 0{f{n)'^~^e{n)) 

k^n a<k^n—a 

Now we apply Abel’s summation formula: 


r iL —ct 

Srf(re; A) = [sd-i{t] A)f{n - t)]”"" + / Sd-i{t] A)f'{n - t)dt + 0{f{nY~^e{n)) 

J a 

rn—a 

= / Sd-i{t]A)f'{n-t)dt + 0{f{n)‘^~^e{n)). (2.4) 

J a 

By Theorem 2.2 we already have Sd{x;A) x f{xY, that is f{nY~^e{n). Thus, 
considering that Sd{x;A) = S(i([xJ; A), we deduce from (2.4): 


Sd{x\A)=( Sd-i{t]A)f{x-t)dt + 0{f{nY^e{n)). ■ 

J a 

Comparing to the recurrence formulas on Proposition 1.3, this theorem is saying that, 
asymptotically, r(n;A) behaves, in some way, very like to f'{n). This intuition shall be 
strongly used in Section 3. Anyway, one might have expected to get 


Sdix;A) ~ Cf^df{xf 


(2.5) 


for some constant depending on / and d. This, however, depends whether or not 


f^-^f{tr-^f{x-t)dt 
x^+oo fix)^ 


( 2 . 6 ) 


exists for integers 1 ^ m ^ d. If so, then Cj^d = 11^=1 where Cf^m are the limits 

( 2 . 6 ). 

One can show, for example, that if Q is the sequence of perfect squares, then holds 
S 2 {x;Q) ~ jx; or if P is the sequence of prime numbers, then S 2 (x;P) ~ ^ x^/(logx)^. 
However, I could not decide a general condition over / that ensures that the above limit 
converges. Perhaps regularly varying type-1 functions satisfy this. 


2.3 Sufficient conditions to form asymptotic bases 

A way to approach this topic is by studying the Schnirelmann density of sequences. 
However, some sequences does not have 0,1 as elements of it, or are not well distributed 
over the residue classes of some integers (e.g. the sequence of even numbers). We shall 
give some definitions to ensure the generality of the results of this subsection. 

‘When e{x) is decreasing, the last step can be done splitting the sum into k ^ n/2 and k > n/2, 
similarly to (2.1). 




Definition 2.7 (Degenerate sequences). Let A ^ S he a sequence. Thus we define 


gcd(^) := min gcd(a, 6). 
a,b£A 


When gcd(^) > 1, we say that ^ is a degenerate sequence and its nondegenerate 
associated sequence is given by A* := {a/ gcd(^) : a € A}, li A ^ A* and A* is an 
asymptotic basis, then we say that ^ is a degenerate asymptotic basis. 


This notion of degenerate sequence prevents us from discarding some sequences from 
our study, as the sequence of even numbers, for example. Nevertheless, it does not say 
anything about sequences like the sequence of odd numbers, so one can be interested in 
to write a sharper definition of degeneracy. Anyway, this dehnition will be sufficient to 
supply our necessities for now. 

One can see that degenerate asymptotic bases are precisely those sequences such that 
there is an inhnite arithmetic progression where any element of it can be written as 
the sum of a fixed number of elements of the sequence. In fact, if A is a degenerate 
asymptotic basis such that every n ^ no can be written as the sum of h elements of its 
nondegenerate associated sequence A*, then every number on the arithmetic progression 
{gcd(A)no + gcd(A)/c : k € ’^^o} can be written as the sum of h elements of A. 

The notations 

A 13 != {n T ^ n G A, b G 33^^ 
dA != A -j- A T ... T A = -[n G -^) ^ d}; 

'-V-' 

d times 

for A,B(^S and d ^ 1 are very common in literature, and will be useful on the proof 
of our next result. Firstly, let us state Schnirelmann’s Theorem on arbitrary sequences 
containing 0,1. 


Theorem (Schnirelmann). Let A £ S be a sequence such that 0 G A. 
density of A is given by: 


a{A) := 


inf 

n=l,2,3... 


s(n; A) 
n 


The Schnirelmann’s 


If (j{A) > 0, then A is a finite basis. 


A proof of this theorem may be found in [7] and [10]. Using this theorem, L. G. 
Schnirelmann was able to prove that every number greater than or equal 2 can be written 
as the sum of a bounded number of primes. Following his line, we are interested in to give 
sufficient conditions to assure that every sufficiently large number can be written as the 
sum of a bounded number of elements from a given sequence. We start by the following 
lemma. 


Lemma 2.8. If A £ S is a nondegenerate sequence with f(x; A) 3> 1, then A U {0} is an 
asymptotic basis. That is, every sufficiently large integer is a sum of a bounded number 
of elements of A. 

Proof. Firstly, r{x] A) ^ 1 implies that 

, „ s(x: A) 

hmmf- > 0. 

X—>' + CXD X 


9 





Thus we have a{A U {0,1}) > 0, so, by Schirelmann’s theorem, A U {0,1} is a finite 
basis of order, let’s say, hi. 

Now, A is nondegenerate, so let a,b € A with gcd(a, 6 ) = 1. Given that the set 
6,26,..., ab (mod a) forms a covering system, we deduce that every integer n ab can 
be written in the form n = li{n)a + l 2 {n)b, where li{n),l 2 {n) G with hin) < a. Let 

6 , 2 := max {h(n) + l 2 {n)) 

ab^n^ab^hi 

If N > ab, then N — ab can be written as the sum of at most hi elements of ^ U {!}, 
that is, there are k,l a Z^o such that 

N — ab = 1 + 1 + ... + 1 +ai + 02 T ... + u;, 

^ V 

k 

where ai,a 2 , ■ ■ ■ ,ai € A and k + I ^ hi. Hence: 

N = li{ab + k)a + l 2 {ab + k)b + oi + 02 + • • • + «/• 

That is, given that k,l ^ hi, we can write N as the sum of 

I lii^ab T k^ T ^ 2(^6 T k'j ^ hi T h2 

elements of A, a fixed number. This completes the proof. ■ 

Proposition 2.9. If A ^ S is a nondegenerate sequence such that r{x;dA) ^ 1 for some 
d ^ 1, then A U {0} is an asymptotic basis. 

Proof. If dA is nondegenerate, then by the previous result the proof is complete, since 
dHu{ 0 } Cd(Hu{ 0 }). 

Suppose then that dA is a degenerate sequence. Let’s consider its nondegenerate 
associated sequence {dA)*. It is immediately apparent that s{x;{dA)*) ^ s{x-,dA) for 
every x ^ 0, therefore r{x; {dA)*) S> 1 and, by Lemma 2.8, dA is a degenerate asymptotic 
basis, let’s say, of order h. As we previously discussed, there will be an infinite arithmetic 
progression of common difference r := gcd(dA) in the sequence hdA, let us say 

P := {no + kr : k & 

Now let us divide the problem into two cases: 

Case 1. 

If there is an element a & A such that gcd(a, r) = 1, then given that the set a, 2a,... ,ra 
(mod r) forms a covering system, we deduce that every integer UQ + ar can be written 
in the form n = t + ak, where t ^ P and A: < r is a nonnegative integer. That is to say, 
every sufficiently large integer can be written as the sum of at most hd + r elements of A, 
so this case is done. 

Case 2. 

If there is no such element in A, given that A is nondegenerate, take a,b ^ A such 
that gcd(a, 6 ) = 1. The set 6 , 26,..., ah (mod a) forms a covering system, so every integer 
n ^ ab can be written in the form n = li{n)a + l 2 {n)b, where li{n),l 2 {n) G Z^o with 
h{n) < a. A fortiori, there is a sufficiently large integer c coprime with r that can be 
written in the form c = jia + j 2 b, for some nonnegative integers ji,j 2 - 

Setting j '■= ji + j 2 , we see that c G jA, so, by a similar argument to the one in the 
previous case, every sufficiently large integer can be written as the sum of at most hd + jr 
elements of A. This completes the proof. ■ 
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The next theorem and corollary captures a very important idea: in order to a given 
sequence be an asymptotic basis, not only the average value of must be big, but the 
’’variance” of around the mean can’t be too high. 

Theorem 2.10. Let A ^ S he a nondegenerate sequence such that s{x] A) ^ for some 
e > 0. If, for some integer Xje holds 

i 4^2 Sdix'jA)"^ 

y rain] A) <-, 

X 

n^x 


then A U {0} is an asymptotic basis. 

Proof. The first condition is a necessary condition by Corollary 2.3. We take d ^ 1/e, 
thus rd{x;A) ^ 1. By the Cauchy-Schwarz inequality, we have 



^ s{x;dA)'^rd{n-,Af 

n^x 


So, by our hypotheses. 


s(x; dA) ^ 


rdin-,A)f xsd{x;A)'^ 
En^.rd{n-,Ay » Sd{x-,Ar 


That is, r{x\dA) S> 1. We then apply Proposition 2.9. 


The corollary follows naturally. 

Corollary 2.11. Let ^ G 5 he a nondegenerate sequence. If for some integer 1 holds 


1 < rd{n-A) < rd{n;A), 

and in addition rd{x; A) x f[x) for some non-decreasing real function f, then A U {0} is 
an asymptotic basis. 

Proof. The condition rd{n;A) S> 1 is equivalent to s(x;^) ^ for some e > 1/d, as we 
previously discussed. Obviously, rd{n; A) ^ 1 is as well a necessary condition to A be an 
asymptotic basis. 

The remaining hypothesis, rd{n;A) <C fd{n',A), implies: 

^ rd(n; Af < ^ fd(n; Af x ^ f{nf < xf{xf x xfrf(x; Af = 

n^x n^x n^x 

Therefore the remainder of the proof is reduced to the case of Theorem 2.10. ■ 

The way that M. B. Nathanson (see [10]) describes the proof that every number 
greater than or equal 2 is the sum of a bounded number of primes involves to prove that 
the sequence of primes satisfies the conditions of Corollary 2.11 for d = 2, using methods 
of sieve theory. Y. V. Linnik (see his book Elementary methods in analytic number theory) 
shows, in the solution of Waring’s problem, that the sequences of perfect A:-powers satisfies 
the conditions of the corollary as well. The direct usage of this corollary is a nice shortcut 
to these proofs, although not very effective, since the hard work is actually to show that 
these sequences satisfy the corollary’s hypotheses. 
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2.4 Some limitations 


From the previous results, two naive questions raise up naturally: 

Question 1. Is s{x] A) 0-regularly varying for every A G 5? 

Question 2. Does every nondegenerate sequence A G 5 with s(x; ^4) ^ for some e > 0 
form an asymptotic basis? 


The answer to both questions is negative. In H. Halberstam and K. F. Roth’s book 
(see [7]), a counterexample to Question 2 it is credited to A. Storhl, in which is shown 
that, given A G <S with s(x; A) = o(x), no asymptotic lower bound to s(x; A) can by itself 
ensure that A is an asymptotic basis. We shall present a counterexample to Question 1. 

Counterexample to 1. Let C := {n^ : n G be the sequence of perfect cubes. We take 

A := C U {(22")3 + A: : 0 ^ ^ n G Z^oj ■ 

From (n + 1)^ — for n ^ 0, we see that our sequence is obtained by adding 

successors of cubes of numbers of the form (2^ to C. Hence, 

^ (22'')2^.(n;A)-nV3^ ^ (22'')2 + n^/^, 

fc<log2 log2(J)-1 fc<log2 log2 (J )-1 

where equality is obtained on both sides infinitely many times, on the left when n = (2^^)3 
for some /c ^ 0 and on the right when n = (2^*)3 + (2^ )^. Let the sequences an ■= (2^")3 
and bn ■= (2^")3 + (2^"’)^. Estimating the sum: 

^ (2^'^)^ ^ log 2 log 2 ([re^''^J) < n^/^loglogn. 

fc^log2 log2(Lnl/3J)-l 


Then follows: 




lim inf ^ = lim inf ■ 


E 

fc^log2 log2(L6n^^J)-l 


(2'=)2 + 6^/3 


and 


1/3 , 
dn + 


s{an]A) 

lim sup —-= lim sup • 


.2/3 


E ( 2 "=)" 

A:<log2 log2(Lan^’^J)-l 


> 0 


< + 00 . 


1 /S 

On ""log log On an log log 

That means that holds s{an',A) ay^logloga„ and s{bn]A) S> bn^^■ As 2an > bn, 
we have 


'S(2an)A) ^ sipn] A) ^ 


bT .. (( 22'^)3 + ( 22 ") 2 ) 2/3 22 


s{an', A) s{an',A) log log i 


> 


22- 


• n 


»- ^1, 

n 


so s(2a;;A) s(x;A), and we conclude that s(x;A) is neither O-regularly varying nor 

X f{x) for some O-regularly varying /. ■ 

^A. Stohr, Geloste und ungeloste Fragen iiber Basen der natiirlichen Zahlenreihe I,’ J. reine 
angew. Math. 194 pp40-65 §7, 1955 
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3 Probability measures on S 


In this section we shall equip S with a convenient probability measure relative to a 
certain sequence. In this probability space, almost all sequences will have a counting func¬ 
tion asymptotic to a certain given function, and this will give us tools to prove existence 
theorems by probabilistic methods, like this (see [5]): 

Theorem (Erdos-Tetali). For every integer h ^ 2, there exists an asymptotic h-basis B 
such that 

rh{n]B) X log(n). 

That is, in some sense, there is an economic or thin asymptotic /i-basis, since there is 
not many ways to write a sufficiently large number as sum of h elements of B, and still it 
is an asymptotic /i-basis. 

Most of the techniques used in the original paper of P. Erdos and P. Tetali revolve 
around disjointness and independence, for low correlation plays a starring role in such 
probabilistic constructions and is commonly dealt with some kind of ’’exponential bounds 
and Borel-Cantelli’s lemma” approach. 

The main results are Corollaries 3.5, 3.9 and Corollary 3.15 along with Theorem 3.16. 
In our discussion about Erdos-Tetali, Conjecture 3.17 arises naturally. 

3.1 Constructing (iS, /u) 

We will state now a definition that follows from a construction. The performability of 
such construction is explained in chapter III of [7] with more details. In this section we 
will denote by w G 5 a sequence chosen at random. 

Definition 3.1. Given a real sequence {an)n^o satisfying 0 ^ ^ 1 for all n G Zj>o, we 

write {S, {an)n^o) to denote the probability space {S,iF,p) satisfying the following two 
properties: 

• For every n ^ 0, the event B^'^l := {w G iS : n G w} is measurable and 

• The events B ^^'>, B ^^'>, ,.. . are independent. 

Thus, we may see r(n;Lo) as Bernoulli trails, with Pr(r(n;a;) = 1) = In this way, 
the r(n;w) are doing the role of the characteristic functions of the events B^"'\ modeling 
an infinity coin flip bet, with some kind of special coins. Notice that the random variables 
r{n;uj) are independent as well. 

We define T by the a-algebra generated by the vr-system generated by the events 
2 ^( 0 ) ^ B'^‘^\ .... The elements of this vr-system is often called cylinder events. We now 

state two lemmas that can be found in [6]. 

Lemma 3.2 (Kronecker’s lemma). Let (a^jn^i a real sequence such that X]n>i con¬ 
verges. If {mn)n^i is a positive unbounded increasing real sequence, then 

'^OkTUk -< rUn. 
k^n 

Lemma 3.3 (Kolmogorov’s two-series theorem). Let (n,W,/r) a probability space and 
(Xfc)fc^i a sequence of independent discrete random variables. If both X]fc>iIE(Wfc) and 
^^^iVar(Xfc) converges, then, with probability 1, converges. 
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Using this we will be able to capture the idea behind the construction of the probability 
spaces (5, {an)n^o)- 

Theorem 3.4. Let (an)n^o be a sequence of probabilities such that «n diverges. 

Then, in (S, (an)n^o)> holds a.s. 

s(x;cj) ~ E(s(3:; uj )), 


that is, with probability 1. 

Proof. Firstly, notice that E(s(a:;a;)) = Now define, for every n ^ no, where no 

is the least integer for such that > 0, 

__ r{n;uj) - an 

■^n •— 

2 

Routine calculations lead us to E(X„) = 0 and Var(X„) = Notice that 

we applied a linear transformation to r(n]uj), so the Xn are still pairwise independent for 
n uq. Also, 


O^n Otn ^ ^ ^ ^ 

(Efc^n«fc)^ (Efcs:n«fc)^ Efc^n-l«fc Efc^gn ’ 

SO we can give an upper bound to the sum En^no and telescope it, showing that 

it converges, then by Lemma 3.3 we have that En^no converges with probability 1. 

Now, noticing that (Efc^n'^fc)„>o unbounded increasing sequence, we apply 

Lemma 3.2 to Xn in order to get that, with probability 1, 


k^x k^x \ j^k 


'^{r{k]uj) - ak) = s{x-,oj) - ^afc. 


k^x 


k^x 


that is equivalent to s(x;uj) ~ Efc^a;®^! required. ■ 

Corollary 3.5. Let f be an asymptotically defined positive function that diverges with 
f being monotonic. If {an)n^o is sequence of probabilities with Un ~ f'{n), then, in 
{S, (an)n;?o) holds a.s. 

s{x;uj) ~ /(x), 

that is, with probability 1. Furthermore, if in addition f is type-1, then by the integral 
formula for s^ (Theorem 2.6) holds, for almost all uj £ S, 

rx—ot 

Sd{x-,uj) / Sd-i{t;uj)f'{x -t)dt, Vd ^ 2. 

J a 

Proof. Using that f is monotonic, 

E(s(x;w)) = ~ ^ ~ ^ f'{n) j f'{t)dt f{x). 

n^x a<n^x a<n^x ^ 

As / diverges, so does En>o hence the claim follows from Theorem 3.4. ■ 
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With this, we can to give a measure /r to (5,W) where almost all sequences ta £ S 
with respect to p satisfy, for example, s(x; w) ~ x/logx, namely (<S, (1/logn)„^ 3 ). Note 
that in this case we are only interested in the asymptotic behaviour of {an)n^o, so we can 
also use, for example, the sequence (an)n^o with := 0 when n < e and an ■= 1/logn 
otherwise. In other words, almost all sequences of this space are in some sense prime-like. 

Remark 1. At this point, no interesting non-probabilistic results has been proven by our 
constructions. Although, Corollary 3.5 can be viewed as a fancy way to show that for 
every suitable type-1 f there is A G 5 such that s(x; A) fix)- 

Remark 2. Corollary 3.5 leads us to the thought that to use f'{n) is the most natural way 
to construct spaces in order to study sequences with asymptotic density approachable by 
/. In fact, it makes us rethink our previous definition of f(n. A), but that is not necessary 
since in the majority of interesting cases we actually have /'{x) x f[x)/x. A precise 
statement of this may be found in N. H. Binham’s et al. book (for instance. Theorem 

1.7.2 and Proposition 2.10.3 of [2]). 

3.2 Integral formula for E(rrf) 

We now focus on investigate how rrf(n;a;) works. As one could expect, it is in 
some sense easier to deal with this probabilistic version of r^ rather than with the non- 
probabilistic Trf. Our goal is to estimate E(rrf(n; ca)) in convenient spaces (5, (an)n^o)- We 
start with a technical lemma that will help us avoid some problems with correlation while 
estimating E(rrf). 

Lemma 3.6. Let A £ S, 1 an integer and n £ Then, for all k ^ 0, 



where l-n/dik) = 1 if k = n/d and 0 otherwise. 

Proof. The hrst side is obvious. For the second, we use the dehnition and follow a purely 
combinatorial approach: 


rd{n;A[j{k})= ^ r{ki] Au {k})r{k 2 ] Au {k})... r{kd; Au {k}) 


ki+k2+...+kj^=n 


^ r{ki] A)r{k 2 ] A)... r{kd; A) + ^ 1 


—n 


ki^k2^...^kfi=n 
k^{ki,--,kd} 
kj^k kjSA 



As i^d%}) bounds the binomial coefficient, we deduce the right side. 


In the proof of the following proposition we will use 2'^ instead of ()) for in despite 
of being a worse bound it works anyway. 
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Proposition 3.7 (Summation formula for ]E(rrf)). Let f be a type-2 function. If {otn)n'^o 
is a sequence of probabilities with an ~ f'{n), then, in {S, {an}n^o) holds, for d^2, 

E(rrf(n;a;))= ^ ¥.{rd-i{k;uj))f'{n-k) + 0{f{n)f{nf~‘^). (3.1) 

a<k^n—a 

Proof. Consider the following; 

E(rd(re; w)) = E ^ rd-i{k; uj)r{n - A:; w) = ^ E(rrf_i(A:; uj)r{n - t, ui)) 


V k^n 


k^n 


Given that the result is valid for d = 2 by independence, we apply induction on d. 
Now we have a little problem with independence since rd-i{k]uj) and r(n — k]uj) are in 
general not independent for d ^ 3. However, since r{n — k‘,u}) is a Bernoulli trial, we have 

E(rrf_i(/c; ui)r{n — k;u)) = E(r(n — k; uj))K{rd-i{k; oj) \ r{n — k]U}) = 1). (3.2) 

One can deduce from Lemma 3.6 applied for ca, d — 1 and k that taking n — k: 

E(r(n - k] uj))E{rd-i{k; to)) 

^ E(r(n — k\co))E{rd-i{k-, to) \ r{n — k]uj) = 1) (3-3) 

^ E(r(n - k-,uj)) | E(rd_i(fc;a;)) -t 2"*“^ ^ E{rd-i-i{lk -{I - l)n;a;)) -t lfc/(d-i)(n - k) 

At the moment, we want to estimate '^^.^.^E[rd-i{k\oj)r{n — k]uj)). Taking the sum¬ 
mation on the last part of the upper bound in the previous inequality, we use the induction 
hypothesis to get, for d ^ 3, 


E 

2^-1 


k^n 

V 

ls£Z<d-l 

7 

II 

E 

yyE(rrf_;_ 


IfiKd- 

-1 k^n 


E 

yyE(rrf_;_ 


k^n 


k^n 


k^n 


< E E Kird-2{k]uj)) < E(srf_2(n;a;)), 


1 k^n 

which lead us to conclude, by Corollary 3.5 and Theorem 2.2, that this term is O (/(re)'^“^). 

Thus, these bounds on K{rd-i{k‘,uj) \ r{n — k] to) = 1) along with /' being monotonic 
0-regularly varying allows us to deduce from (3.2) and (3.3), for d ^ 3, 

^ E(rd_i(fc;w)r(n — fc; w)) = ^ E(rd-i(fc; a;))E(r(n - fc; a;))-|- ^ E{rd-i{k-Lo)r{n - k-to)) 

k<.nf2 n/2^k^n 


k^n 


'^E{rd-i{k]U}))E{r{n - t,u])) + 0{f'{n)f{n) 


d-2\ 


k^n 


Therefore, bounding the tails just like (2.2) and (2.3), we deduce for d ^ 2: 
E{rd{n-,u})) = ^ E{rd-i{k;uj))f{n-k)+0{f'{n)f{n)'^~'^). 

a<k^n—a 
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Theorem 3.8. Under the same conditions of Proposition 3.7 holds, for d'^1, 

E(rrf(n;w)) x /'(n)/(n)'^"^ 

Proof. The result is valid for d = 1 by hypothesis, so we apply induction. Taking d ^ 2 
and assuming the statement valid for d — 1, by Theorem 3.7: 

E(rrf(n;a;))x ^ E{rd-i{k-,uj))f'{n - k) 

a<k^n—a 

- E /(^)"”'/'(fc)/'(n - k) 

a<k^n—cx 

Now let us bound this sum. Firstly, given that f' is O-regularly varying, monotonic 
and bounded, when d = 2 we have 

E f'{k)f'{n-k) = E f'{k)f'{n-k)+ E f'ik)f'{n-k) 

a<k^n—a a<k^n/2 n/2<k^n—a 

fin/2) E fik)+fin/2) E f'i^-k) 
a<k^n/2 n/2<k^n—a 

< /'(n/2)/(n/2) x fin) fin). 

Thus: 

E fik)'^~^fik)fin - A:) < E fik)fin - A:) < /'(n)/(n)'^"^ (3.4) 

a<k^n—a a<k^n—a 

and 


E fik)'^-^fik)fin-k)-> fin) E /(fc)""Y(n - A:) 

a<k^n—a n/2<k^n—a 

> fin) fin/2)'^-'^ E f'i^ - k) 

n/2<k^n—a 

» fin)fin/2)<^-^ » fin)fin)<^-\ (3.5) 


The statement then follows from (3.4) and (3.5). ■ 

Again, similar to the case of Theorem 2.6, one might expect to obtain, in view of the 
constants of (2.5), 

E(rrf(n;a;)) ~ dC'/,d/'(n)/(n)'^“\ (3.6) 

that is a much sharper result. If / is C'^ type-2, then by Euler-Maclaurin summation 

/ n—Q. 

fit)’^~‘^fit)fin - t)dt + 0(/'(n)/(n)‘^“2), 

Q;<,KS-n—a 

SO (3.6) depends whether or not (2.6) and 


3;->-+cxD /'(x)/(x)"*“^ 


(3.7) 
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exists for each 1 ^ m ^ d. If the above converges to some K > 0, then, calling Cf^rn the 
limits from (2.6) and given that 

^ J f {x - f'{t)dt = {m - 1) J f{x-t)’^~^f'{x-t)f{t)dt + f{a)"^~^f'{x-a) 

= (m-l) r fit) fix-t)dt + f{ar-^ fix-a) 

J a 

K{m- l)/'(x)/(x)'”"\ 

one can integrate both sides in order to get, assuming the convergence of (2.6), that 
K = for the integral on the right side must be Cf^rnf{x)^- As Cf^d = 0^=1 ^f,ki 

it would imply (3.6) immediately by a simple induction argument. 

One can show, for example, that in (5, holds E(r 2 (n; w)) ~ or that 

in (5, (1/log n)„ 5 , 3 ) holds E(r 2 (n;a;)) ~ n/(logre)^. Again, however, I could not decide 
a general condition over / that ensures the convergence of both (2.6) and (3.7), and my 
suspicions revolve around regularly varying type-2 functions. 

We conclude by presenting an integral formula for E(rrf) in certain spaces. 

Corollary 3.9 (Integral formula for E(rrf)). Let f be a C'^ type-2 function. If (an)nSsO is 
a sequence of probabilities with On ~ f'{n')> then, in (S, (an}n^o) holds, for df 2, 

rn—OL 

E(rrf(n;a;))= / "&{sd-iit\iu))f"in-t)dt + 0{f'{n)finY~‘^). 

J OL 

Proof. By Proposition 3.7: 

E(rd(n;a;))= ^ E(rrf_i(fc;a;))/'(n - A:)0(/'(n)/(n)'^“^). 

a<k^n—a 

Applying Abel’s summation formula: 

pn—OL 

Eird{n;uj)) = [E(sd_i(t; w))/'(n - t)]”““-t / Eisd-iit;uj))f"in-t)dt-\-0{f'in)f{nf~f 

J ot 

pn—a 

= / Eisd-iit-,uj))rin-t)dt + Oifin)finy-^). ■ 

J a 

3.3 /-regular asymptotic bases 

The discussion succeeding Theorem 3.8 leads us to the following thought. 

Definition 3.10 (/-regular asymptotic basis). Let / be an asymptotically defined differ¬ 
entiable real-valued function. An asymptotic h-basis B ^ S is said to be /-regular if, for 
every dfh, there are positive real constants Md such that 

rdin;B) ~ Mdf{n)f{n)'^~^. 

If instead of we have only ’x’, we call it 0(/)-regular. 

Proposition 3.11. Let B G S and f a type-2 function such that s {x-,B) X fix). If 
rfiin; B) x f in) f in)^~^ for some hf 2, then for every dfh holds 

rdin-,B) X /'(n)/(n)'^"/ 
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Proof. Since the case d = h holds by hypothesis, we apply induction for d > h. By the 
recursive formulas: 


rdin]B) = '^r{k;B)rd-i{n-k;B) ^ r{k; B)f’{n - k)f{n - kf 

k^n a<k^n—cx 

Using Abel’s summation formula: 

rd{n; B) x [s{t] B)f'{n - t)f{n - - j s{t; B) “ t)f{n - dt 

X - y f{t) dt + o{f'{n)f{nY~^). 

Now let us bound the integral. Firstly, we have: 


-J fit) 2^dt</(n) J dt ^ f (n) f {nf 

And then, by O-regular variation: 

-/ fit) (^■^fin-t)fin-t)'^-‘^^ dt:$> - j fit) (^■^fin-t)fin-t)‘^-‘^^ dt 

rril2 / J \ 

> fi'n/‘2) J J 

» /'(n/2)/(n/2)''-i x /'(n)/(n)''-i 
Finally, we conclude by induction that for every d ^ h holds 

rd(n;B) X/'(n)/(n)‘^"^ ■ 


To prove the existence of 0(/)-regular asymptotic bases we will use two largely known 
lemmas very recurrent in probabilistic methods in general. The first is an exponential 
bound taken from Appendix A of [1] credited to H. Chernoff, whereas the second is the 
Borel-Cantelli lemma, that can be found in chapter III of [7]. 

Lemma 3.12 (Chernoff bound). Let iXk)k^i be mutually independent Bernoulli trials 
and Sn := Yl’k=i Then given n ^ 1, for all e > 0 holds 


Pr 


- 1 


> e < 2e 


-<5E(S„) 


nsn) 

where 6 = min{(l + e) log(l + s) — e, e^/2}. 

Lemma 3.13 (Borel-Cantelli lemma). Let (X, 501,/r) be a measure space and iEn)n^i o, 
sequence of elements o/90I. IfhiT^n) < +oo, then 


n u =“■ 


Recapturing our (5, (an)n^o) setup, Chernoff bounds generally fails when we are deal¬ 
ing with rdin,u}) for d ^ 3, but for d = 2 it works great! In fact. 
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Theorem 3.14. Let f a type-2 function. If f'{x)f{x) » log(x), then there is a 
constant K ^ 1 such that in {S,{an)n'^o), with (an)n^o being a sequence of probabilities 
with an ~ Kf'{n), a.s. holds 

r 2 {n;uj) x /'(n)/(n). 

Proof. Firstly, consider 

P2(n,Lo) := r{k,uj)r{n — k,io). 
k<nl2 

Note that since r{k,uj)r{n — k,uj) can be seen as a Bernoulli trial with expected value 
akan-k in i<S, {an)n^o), this is the sum of mutually independent Bernoulli trials, so we 
can apply Lemma 3.12. But first, notice that 


r 2 (n,a;) = 2 p 2 {n,uj) + r{nj2,uj), 


(3.8) 


so by Theorem 3.8 we have 

E{p 2 {n;uj)) X f'{n)f{n). (3.9) 

So for sufficiently large n we must have E(p 2 (n; oj)) > ck log(n), for some fixed constant 
cx > 0 depending on K. With this in mind. Lemma 3.12 implies, as n —>■ -|-oo, that Ve > 0: 


Pr 


P 2 {n;uj) 

E{p2in;uj)) 



< n 


—(5c I, 


(3.10) 


for some <5 > 0 depending on e. 

To obtain p 2 {n\uj) x E(/? 2 (n;w)) a.s., it is sufficient to take some £ < 1 and ck > 0 
such that 5ck > 1, thus we can apply Borel-Cantelli’s lemma and our statement follows 
from (3.8) and (3.9). 

Taking e = .98, Lemma 3.12 says that <5 > .37, so we just need ck ^ 3. A quick look 
at Proposition 3.7 reveals that we may take ck = Kci, for K ^ 1 can be easily drawn 
out of the sum in (3.1) and it does not affect the lower bound obtained in Theorem 3.8. 
Therefore, it is sufficient to take iL ^ 3/ci. ■ 


When f'{x)f{x) >- log(x) we do not have to deal with careful choices of constants since 
the Chernoff’s bound in (3.10) is, in this case, O(n^) for every e > 0. 

This method, however, cannot be extended for the case f'{x)f{x) = o(log(x)), that is 
basically when f{x) = log(a;)^/^). The same occurs when we try to extend this for 

d ^ 3, requiring a very elaborated argument that avoids the problems with the Chernoff 
bounds. A proof for d = 3 may also be found in chapter 8 of [1]. 

We have then the following corollary of Theorem 3.14. 

Corollary 3.15. For every type-2 function f with f'{x)f{x) 3> log(x) there exists a 
Q{f)-regular asymptotic 2-basis. 

Proof. Follows immediately by Theorem 3.14 and Proposition 3.11. ■ 


To ensure the existence of an asymptotic h-basis B such that rk{n;A) log(n), 
P. Erdos and P. Tetali considered a particular space (5, (an)n^o) with a sequence of 
probabilities (an)n^o being such that 

log(n)^/^ 

for a certain constant C depending on h. This is, however, x ^ (x^/^ log(x)^/^), so with 
the aid of Proposition 3.11 we can restate Erdos-Tetali as follows: 
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Theorem 3.16 (adapted Erdos-Tetali). For every h ^ 2, 

there exists a log{x)^^^)-regular asymptotic h-basis. 


Proof. Corollary from Erdos-Tetali and Proposition 3.11. 


The methods used in [5] extends to general /. In fact, replacing the sequence of 
probabilities (an)n^o used in it, it is possible to show that for every type-2 function 
/ with f {x) f {x)^~^ ^ log(x) there exists a 0(/)-regular asymptotic /i-basis, reinforcing 
the role of ’’minimality” perpetrated by log(a:) in this framework. 

At the moment, it seems that 0(/)-regular asymptotic bases not only exist, but are 
also reasonably treatable. But 

Question. Does there exists f-regular asymptotic bases? If so, for what kind of f? 

We firmly believe that considering functions such that (2.6) and (3.7) converge for every 
integer m ^ 2, some methods similar to those described in [5] with an improved version 
of Proposition 3.11 can ensure both the existence and ’’treatability” of such sequences. 
Unfortunately, a proof or counterexample for this will not be found in this paper. 

Let us then digress a bit about a question raised by P. Erdos in 1956 in [4]. Actually, 
it became a conjecture that, just like Erdos-Turan, states what should be an asymptotic 
lower bound to the limsup of rh in asymptotic /i-bases. 


Erdos 1956 conjecture. If A ^ S is an asymptotic h-basis, then 


lim sup 

n—>-+oo 


rh{n;A) 

log(n) 


> 0. 


This is way stronger than Erdos-Turan, as it says that rh{n; A) is not only unbounded 
above, but diverges in a sense that is never o(log(n)). One may feel unlikely to believe it, 
after all, from where did log(n) popped out!? 

It actually came from the technical limitations of the ’’exponential bounds and Borel- 
Cantelli’s lemma” approach. Given d ^ 2, for any suitable slowly varying function L it is 
possible to construct a space where, in view Theorem 3.8, holds IE(rrf(n; w)) x L{x), e.g. 
(5, (an)n^o) with an ~ for some iL > 0. The standard approach of the 

exponential bounds, however, simply fails when L{x) = o(log(x)), suggesting that purely 
probabilistic methods may not suffice on their own to deal with such problems. 

0ther than that, it seems like there is no other immediate reason to believe on this 
statement. If only we could show that with some appropriate measure in S we can ensure 
rd to be ’’close enough” to E(rrf) with probability > 0 we would be able to ensure the 
existence of asymptotic bases that contradict Erdos’ conjecture. 

Anyway, as showed by M. B. Nathanson in [11], representation functions that grow 
arbitrarily slowly exist in a broader way when we take Z instead of Zj,o. In fact, Nathanson 
showed that Erdos-Turan is false for Z. 

If representation functions could grow arbitrarily slowly, then Erdos-Turan conjecture 
would be a maximal statement about the limsup of such functions. We finish this section 
with a version of Erdos’ conjecture in terms of 0(/)-regular asymptotic bases. 

Conjecture 3.17 (adapted Erdos 1956 conjecture). There are no Q{f)-regular asymptotic 
h-basis with 

/(x) = o(xVMog(x)'/"). 
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4 Remarks on Goldbach’s conjecture 


The discussion on the previous section may lead us to expect that if P is the set of 
primes, then for d ^ 2 we must have something like rrf(n;P) ~ log(n)'^. In fact, 

under the assumption of a weak version of the Generalized Riemann Hypothesis, G. H. 
Hardy and J. E. Littlewood showed in [8] that, for d ^ 3, when n runs through the natural 
numbers with same parity than d holds 


rrf(n;P) ~ 


2Cd / (p-l)‘^ + (-l)^(p-l) \ 

(d — 1)! log(n)‘^I (p — ) 

p\n 

p^3 


where Cd is a constant defined by 


Q;=n(l- 


(-1)" \ 


The assumption was: 


Weak GRH. There is an e > 0 such that for every Dirichlet character x holds: 

3 

If L{s, x) = 0, then Re(s) ^ ^ 

Actually this estimative has been rigorously proved by I. M. Vinogradov without any 
unproven assumption, although the case d = 2 remains (to this day!) unsolved, and not 
even the methods used by Hardy &: Littlewood on d ^ 3 assuming the full power of GRH 
works in this case. Anyway, it motivates 


Conjecture A (Hardy-Littlewood). As n runs through the even numbers, 


r 2 (n;P) ~ 2C2 


n 

log(n)2 



where C 2 is the constant 


C.;=n(l 

p>3 


{p-ly 


Through an elementary combinatorial heuristic approach, we shall motivate a related 
conjecture about the behaviour of r 2 (n;P) that ’’miss” Hardy &: Littlewood’s conjecture 
only by a constant. Let us call it Gonjecture B. 


Conjecture B. As n runs through the even numbers, 

n n 


r2(n;P) 


(p(n) log(n)^ 

Indeed, for n —>■ +00 through the even numbers we have 


n 


<f{n) 


p\n 

p^3 


P 


p-1 




p\n 

p^3 


I 


{P - 1)^ 


n 

p\n 

p^3 


P- 1 

p-2 
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Given that C 2 is constant, one can see that 


ip{n) 


n 



p^3 


Thus we can state a Conjecture B’ that is slightly weaker but do not disagree with 
Hardy-Littlewood’s conjecture. 

Conjecture B’. As n runs through the even numbers, 



Note that both three conjectures imply that for sufficiently large N, all even numbers 
greater than N are writable as sum of two primes. It does not imply Goldbach conjecture, 
but, anyway, ’’asymptotic Goldbach conjecture” is until the moment an open problem as 
well. So let us motivate Conjecture B. 

4.1 Setup and motivation 

For fixed n > 1, the first that can be said about primes p, q that sum up to 2n is that, 
WLOG, p ^ n ^ q. Clearly p \ 2n implies p | 2n — p, so we must have gcd(p, 2n) = 1. 
Then we define some notation. 

Definition 4.1 (Goldbach’s function). Given n ^ 1 let g{n) := |{p ^ re : 2n — p G P}|. 

The notation on this section will be a bit different from the previous parts of the paper. 
Firstly, we will deal with g{n) instead of r 2 (re;P). It can be easily ’’undone” noticing that 
r2(2re;P) = 2g{n) — Ip(re), where Ip = r(-,P). 

Definition 4.2. For re ^ 1 we define: 

• An := {1 < k < n : gcd(A:, 2re) = 1}, 

• := {re < A: < 2re — 1 : gcd(A:, 2re) = 1}, 


and also: 


• K{n):=\An\, P(re):=|T„nP| 


Q{n) := \Bn nP|. 


Observe that k G An if and only if 2re — A: G Bn, so \An\ = \Bn\- It also implies that if 
2re — p G P, then p G G P or p G Bn G P. We can thus write K{n), P{n), Q{n) in terms 
of vr, where oo is the distinc prime factor counting function: 


K{n) = (p(2re)/2 — 1 
P{n) = 7r(re) — uj{2n) 
Q{n) = 7r(2re — 2) — 7r(re). 


With this we can deduce a nice result. 


Proposition 4.3. //7r(2re) — io{2n) > (p(2re)/2, then g{n) > 0. 
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Proof. Consider the following bijection: 

fn ■ ^ Bfi 

k ^ 2n — k 

If no p € n P is such that fn{p) G Bn H P, then fn{An D P) C Bn\{Bn Pi P). It 
follows: 

|/„(^„nP)| ^ |5„\(s„nP)| 

P{n) ^ K{n) — Q{n) 

P{n) + Q{n) ^ K{n) 

Therefore, by the contrapositive, if the last inequality was reversed then there would 
be a p € An H P such that fnip) £ Bn n P, that is, g{n) > 0. Finally: 

P{n) + Q{n) = 7r(2n — 2) — u{2n) ^ 7r(2n) — uj{2n) — 1 

So, in order to have P{n) + Q{n) > K{n), it is sufficient that: 

7r(2n) - a;(2n) - 1 > - 1 

TT{2n)-uj{2n)>^^^. ■ 

Because of the growth order of the functions involved, the inequality on previous 
proposition holds only to a finite number of cases, that cease to occur when 2n is near 
10®. A deeper study on this perhaps would reveal weaker conditions to guarantee an even 
number to be sum of two primes, but that will not be our intention in this section. 

4.2 The random variables g{n) 

Rewriting g in terms of the bijection fn used on the proof of Proposition 4.3: 

9 {n) = p) = ^ lp(2n-p) 

p^n n^p<2n 

= lp(2n — k) + lp(n) 

keBnPiF 

= +1pW 

keBnPiF 

= ^ lp(A:) + lp(n) 

kefn^^iBnClF) 

The difficult in to effectively understand g is exactly to count the primes on the pre¬ 
image of Bn n P by /„. An attempt to avoid this problem can be to choose a subset of Bn 
with Q{n) elements at random, that is: 

9{n) := ^ lIp(A:), 

kefn\J) 
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= {X C Bn: |X| = Q(n)}. 

Taking J at random, g(n) becomes a random variable. From a population of K{n) 
elements (the set P{n) are primes (the set H P). Now we draw Q{n) elements 
(the set fn^{J)) and want to count how many primes we got. That is it! The r.v. g{n) 
must follow a hypergeometric distribution^- with parameters K{n), P{n),Q{n). Thus 


Pr(5(n) = k) 


(P{n)\ /K{n)-P{n)\ 
\ k ) \ Q(n)—k ) 



defined for 

max{0, P(n) + Q{n) — K{n)} ^ k ^ m.m{P{n),Q{n)}, 

with Pr(^(n) = A;) = 0 for other values of k. Note that when P(n) + Q{n) > K{n) we have 
Pr( 5 (n) = 0) = 0, that is another way do prove Proposition 4.3. We synthesize all this 
in the notation g{n) ~ \i{K{n),P{n),Q{n)), which reads ”g{n) follows a hypergeometric 
distribution of parameters K{n), P{n),Q{n)”. 


Definition 4.4. For each n ^ 1, we define g{n) r-u HK{n),P{n),Q{n)). 


The way we just defined it makes the g{n) completely uncorrelated, although it is a 
little uncomfortable to assume this since {An)n^i is not a pairwise disjoint sequence. The 
study of how g{n) correlates with g{m) for n ^ m seems to be a fruitful way to obtain 
information about primes, but, again, it escapes our scope. 

Anyway, it is interesting to remark that as we defined g{n) in terms of a random subset 
J of An , one way to do this is to ” keep” J as if it were a set of ” real primes” and see how 
it affects g{m). Note that it would not change the counting functions K,P,Q. 

If you start this with ^(1) and then choose g{n) subjected to all ’’chosen Js” used to 
stipulate g{k) for 1 ^ fe < n, then you will always end up with the actual sequence of 
primes and actual values of g{n) — lp(n). This is just a straightforward remark on how 
primes are absolutely deterministic. 


4.3 Asymptotic behaviour of g{n) 

The following lemma can be easily deduced with known estimates of TT,(p,uj, both 
presented in Hardy & Wright’s number theory book (see chapter XVIII of [9]). We will 
omit the proof. 

Lemma 4.5. When n —>■ +oo.' 

(i) P{n) ~ Q{n) ~ n/log(n); 

(a) n/loglog(n) <C K{n) <C n. 

Also, K{n) < n for all n ^ 1. 

Let us explore now the hypergeometric distribution of g{n). Following [12], we may 
calculate the expected value and variance for each re ^ 1: 




P{n)Q{n) _ 
K{n) ’ 


■'■To the puzzled reader, we suggest a quick look at M. Skala’s paper [ 12 ]. 
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• Yav{g{n)) 


P{n)Q{n){K{n) - P{n)){K{n) - Q{n)) 
K{nY{K{n) - 1) 


Recording that K{n) = (^(2n)/2 — 1, it can be deduced from Lemma 4.5 that: 

2n2 


\ai{g{n)) ~ E 5 (n) 


ip{2n) log(n)2 


We now state a tail inequality to the hyper geometric distribution presented in [12] 
credited to W. Hoeffding and V. Chvatal. A proof may be found on Chvatal’s original 
paper (see [3]). 


Lemma 4.6 (Hypergeometric tail inequality). Let X ~ h(Ai, M, n). For all t ^ 0 holds 

Pr(|A - E(A)| ^ tn) ^ 


The setup to deduce our conjecture is almost done, the only remaining thing is to set 
a convenient probability space to apply Borel-Cantelli’s lemma (see Lemma 3.13). The 
naivest thing we can do is to simply take the product space with the product measure of 
each g{n) for n ^ 1. And that is exactly what we are going to do! 

In the canonical product space, {g{n))n^i will be a sequence of pairwise independent 
r.v.s, exactly what we said that could be uncomfortable to some people. Well, this will 
only hurt a bit! 

Theorem 4.7. Let (H,/”, P) he the canonical product space of the discrete probability 
spaces of g{n) for n ^ 1. In this space, a.s. g{n) ~ ^g{n) as n ^ +oo. 

Proof. Equivalently to our statement, we will show that for all e > 0 holds 



Our goal is to apply Borel-Cantelli lemma, so we start by applying Lemma 4.6: 
P {\~g{n) - E~g{n)\ ^ tQ{n)) ^ 


Now dividing what is inside by Eg{n) = P{n)Q{n)/K{n): 


P 


g(n} 


Eg{n) 


- 1 




tK{n) 

P{n) 




Using Lemma 4.5, we may use that K[n) < n and, for sufficiently large n, that 
P{n) > n/21og(re) and Q{n) > n/21og(n). So: 

9{n) 


P 


Eg{n] 


- 1 




nt 


n/2 log(n) 


< e 


—nP/2 log(n) 


For each n ^ 2, let us change t for l/21og(n)^, in order to obtain: 


P 




Eg{n) 


- 1 




1 


log(n) 


< e 


—n/8 log(n)® 


Observe that lim 1/logn = 0, and also clearly e "•/siogW® < +c)o, thus for 

n—>-oo z 

every e > 0: 

9{n) 


Er 

n '^1 


Eg{n] 


- 1 


^ e ] < + 00 , 


and our statement follows applying Borel-Cantelli lemma. 
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This lead us to expect that the ’’the real 5 ” must behave like this, that is, 


g{n) ~ 


2'n? 

ip{2n) log(n)^ ’ 


which translates exactly as the Conjecture B stated on the beginning of the section. 
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